Highly efficient nonlinear regression for big data with lexicographical splitting
نویسندگان
چکیده
منابع مشابه
Efficient Regression for Big Data Problems using Adaptive Sparse Grids
The amount of available data increases rapidly. This trend, often related to as Big Data challenges modern data mining algorithms, requiring new methods that can cope with very large, multi-variate regression problems. A promising approach that can tackle non-linear, higher-dimensional problems is regression using sparse grids. Sparse grids use a multiscale system of grids with basis functions ...
متن کاملHighly Efficient Hierarchical Online Nonlinear Regression Using Second Order Methods
We introduce highly efficient online nonlinear regression algorithms that are suitable for real life applications. We process the data in a truly online manner such that no storage is needed, i.e., the data is discarded after being used. For nonlinear modeling we use a hierarchical piecewise linear approach based on the notion of decision trees where the space of the regressor vectors is adapti...
متن کاملNonlinear Regression With Censored Data
Suppose the random vector (X,Y ) satisfies the regression model Y = m(X) + σ(X)ε, where m(·) = E(Y |·) belongs to some parametric class {mθ(·) : θ ∈ Θ} of regression functions, σ2(·) = Var(Y |·) is unknown, and ε is independent of X. The response Y is subject to random right censoring, and the covariate X is completely observed. A new estimation procedure for the true, unknown parameter vector ...
متن کاملFast Gaussian Process Regression for Big Data
Gaussian Processes are widely used for regression tasks. A known limitation in the application of Gaussian Processes to regression tasks is that the computation of the solution requires performing a matrix inversion. The solution also requires the storage of a large matrix in memory. These factors restrict the application of Gaussian Process regression to small and moderate size data sets. We p...
متن کاملParametric Gaussian Process Regression for Big Data
This work introduces the concept of parametric Gaussian processes (PGPs), which is built upon the seemingly self-contradictory idea of making Gaussian processes parametric. Parametric Gaussian processes, by construction, are designed to operate in “big data” regimes where one is interested in quantifying the uncertainty associated with noisy data. The proposed methodology circumvents the welles...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Signal, Image and Video Processing
سال: 2016
ISSN: 1863-1703,1863-1711
DOI: 10.1007/s11760-016-0972-8